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INTRODUCTION 


Several  types  of  data  adaptive  detection  structures*  have  been  proposed  for  use  in 
environments  where  the  signal  and  noise  statistics  are  only  vaguely  known  and  where  insuf- 
ficient a priori  information  is  available  to  design  optimal  detectors.  Several  types  ot  adaptive 
mean  level  detectors  have  been  implemented  to  provide  improved  performance  over  conven- 
tional nonadaptive  systems  when  the  noise  statistics  are  nonstationary.*  —>^'4,5,6  other 
forms  of  adaptive  receivers  based  on  pattern  recognition  techniques,  decision  directed  feed- 
back, nonparametric  statistics  and  sequential  processing  are  described  in  References  7 
through  14. 

Adaptive  implementations  of  linear  prediction  filters  have  recently  been  proposed 
for  spectral  analysis,  instantaneous  frequency  estimation  and  speech  encoding  applications 
(References  1 5 through  20).  A discussion  of  the  properties  and  diverse  applications  of  linear 
prediction  filters  is  contained  in  Reference  21.  The  computational  simplicity  ol  adaptive 
linear  prediction  (ALP)  implementations  based  on  the  Widrow-Hoff  least  mean  squares 
(LMS)  adaptive  algorithm  is  discussed  in  References  15,  16,  17  and  18. 

The  application  of  LMS  implementations  of  ALP  filters  to  maximum  entropy  spectral 
analysis  is  described  in  References  15  and  18.  The  improved  frequency  resolution  which  is 
often  attainable  by  the  maximum  entropy  method  (MEM)  of  spectral  analysis  is  well  docu- 
mented in  References  21  through  26.  Baggeroer  has  shown  however  that  the  high  variance 
associated  with  inverting  the  quantity  |l  - H(co)|",  where  H(cj)  is  the  transfer  function  ot 
the  linear  predictive  filter,  can  lead  to  increased  confidence  intervals  for  MEM  estimates  of 


sinusoidal  components  in  noise. 

It  has  beenshown^^  that  for  multiple  sinusoids  in  uncorrelated  noise  the  magnitudes 
of  the  MEM  frequency  estimates  at  the  sinusoidal  frequencies  degrade  approximately  as 
the  square  of  the  input  signal-to-noise  ratio  (SNR)  in  the  limit  of  large  prediction  filter 
lengths.  Satorius  and  Zeidler^^  have  shown  further  that  the  resolution  attainable  by  MEM 
is  frequency  dependent  when  the  sinusoids  are  embedded  in  an  additive  correlated  noise 
background.  Marple"^  has  shown  that  the  two-sinusoid  resolution  performance  of  MEM 
degrades  as  a function  of  input  SNR  to  a resolution  which  is  no  better  than  that  attainable 
by  conventional  spectral  analysis  techniques  at  low  input  SNR. 

Alternate  approaches  to  frequency  estimation  at  low  input  SNR  using  an  ALP  as  a 
prefilter  to  a conventional  spectrum  analyzer  are  discussed  in  References  16  and  1 7.  In  this 
case  the  ALP  filter  is  used  to  estimate  the  frequencies  at  which  coherent  energy  is  present  in 
the  input  data  sequence  and  the  spectrum  analyzer  examines  the  frequency  content  of 
either  the  adaptive  filter  weights  or  the  adaptive  filter  output  as  illustrated  in  Figure  1 . The 
ALP  filter  can  be  used  to  suppress  either  correlated  or  uncorrelated  noise  components  of 
the  input  data  by  selecting  the  prediction  distance  (A  in  Figure  1 ) on  the  basis  of  available 
a priori  information  concerning  the  expected  autocoherences  of  noise  and  signal  components. 
Suppression  of  the  additive  noise  prior  to  spectral  analysis  reduces  the  interference  between 
the  signal  and  noise  components  of  the  data  before  the  spectral  estimate  is  made.  Appro- 
priate selection  of  A provides  bandwidth  selective  spectral  estimates.”®  This  device  has  been 
called  the  adaptive  line  enhancer  (ALE).  The  LMS  algorithm  provides  a method  for  updat- 
ing the  ALP  filter  coefficients  to  track  temporal  variations  in  the  signal  and  noise  statistics. 


An  adaptive  detection  system  is  one  in  which  the  detectoi  stiuctuie  is  based  to  some  extent  on  the 
leceived  data  and  which  incorporates  a mechanism  to  vary  the  detectoi  stiucture  as  the  received 
signal  and  noise  characteristics  evolve  in  time. 


(a)  ALE  weight  vector 


(b)  ALE  output  sequence 

Figure  1.  Adaptive  detector  implementations. 


The  detection  performance  of  a data  adaptive  receiver  in  a nonstationary  signal 
and  noise  environment  is  a difficult,  if  not  intractable,  problem.*'^  The  devices  proposed  in 
References  10  through  14  are  not  readily  amenable  to  mathematical  analysis  and  their  per- 
formance generally  has  been  defined  empirically  on  the  basis  of  computer  simulation.  In 
cases  where  the  statistics  of  the  signal  and  noise  spectra  evolve  slowly  in  time,  however,  local 
stationarity  can  be  invoked. In  these  cases  it  is  often  possible  to  define  optimal  receiver 
configurations  for  the  specified  operating  conditions.  Steeson^  and  Dillard^  discuss  perform- 
ance measures  of  an  adaptive  mean  level  detector  relative  to  that  of  the  optimal  detector  for 
specified  stationary  inputs  as  a function  of  the  response  time  of  the  mean  level  detector. 

Another  performance  criterion  known  as  the  asymptotic  relative  efficiency  (ARE)^® 
is  sometimes  used  to  evaluate  the  performance  of  an  adaptive  receiver.  * * The  ARF,  is  a 

comparative  measure  of  the  number  of  data  samples  required  by  a given  detector  to  achieve 
the  same  detection  probability  and  false  alarm  rate  as  the  optimal  detector  for  specified  test 
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conditions.  Nolte°’^  has  shown  that  for  the  case  of  detecting  one-of-M  known  synchronous- 
recurrent  transients  in  Gaussian  uncorrelated  noise,  the  optimal  detector  may  be  implemen- 
ted through  an  adaptive  sequential  implementation. 

For  time-evolving  noise  and  signal  statistics  an  effective  adaptive  receiver  must  ex- 
hibit robust  detection  performance  over  the  expected  range  of  inputs  statistics.  Even  in  this 
case,  however,  performance  analysis  based  on  local  stationarity  assumptions  gives  baseline 
performance  data  for  selecting  desirable  design  parameters  of  adaptive  detectors  and/or 
estimators.  For  example,  the  ability  of  the  ALE  to  estimate  the  frequency  of  a sinusoid  in 
uncorrelated  noise  was  shown  to  be  dependent  on  the  adaptive  filter  length  and  the  adaptive 
time  constant.  The  estimation  of  the  freq^uencies  of  multiple  sinusoids  in  uncorrelated 

noise  was  shown  to  be  dependent  on  A as  well.^°  Even  more  dramatic  performance  varia- 
tions as  a function  of  A can  be  expected  when  operating  in  a correlated  noise  environment. 

Using  an  analysis  based  on  deflection  statistics,  comparative  detection  performance 
of  both  ALE  implementations  relative  to  that  of  a conventional  nonadaptive  detection 
system  was  considered  by  Burdic^“  for  sinusoidal  signals  in  both  stationary  and  nonstationary 
white  noise.  There  it  is  shown  that  both  implementations  of  the  ALE  reduce  the  degradation 
in  system  performance  due  to  variations  in  noise  statistics.  Reeves  has  derived  receiver  oper- 
ating characteristic  (ROC)  detection  curves  of  the  ALE  weight  implementation  for  the  signal 
known  except  for  phase  (SKEP)  detection  problem.  (In  the  SKEP  problem,  the  input  signal 
is  a sinusoid  of  known  frequency  and  unknown  phase  in  stationary  white  noise  of  known 
power.)  Under  the  condition  that  the  total  length  of  the  input  data  sequence,  N,  is  much 
larger  than  L,  Reeves  compares  the  detection  performance  of  the  ALE  weight  implemen- 
tation to  that  of  the  classical  optimal  detector  and  incoherently  averaged  DFTs  of  length 
L.  Tire  optimal  detector  is  well-known  14,36  and  may  be  shown  to  be  an  N-length  coherent 
DFT  which  processes  the  entire  N-length  sequence.  The  results  show  that  the  detection 
performance  of  a single  L-weight  ALE  detector  evaluated  at  the  N*^-point  of  the  input 
data  sequence  compares  favorably  with  N/L  incoherent  averages  of  an  L point  DFT.  In 
addition,  a methodology  was  derived  for  defining  optimal  time  constants  of  the  ALE  weight 
implementation. 

This  paper  investigates  the  detection  performance  of  the  ALE  output  implementation 
for  the  SKEP  detection  problem  and  presents  comparative  performances  of  the  two  distinct 
ALE  implementations  for  this  case. 


ADAPTIVE  LINE  ENHANCER  (ALE)  PERFORMANCE 
FOR  SINUSOIDAL  INPUTS  IN  WHITE  NOISE 

As  illustrated  in  Figure  1 , the  ALE  consists  of  an  L-tap  linear  prediction  filter  in 
which  the  filter  coefficients  are  updated  at  the  input  sampling  rate  using  the  Widrow-Hoff 
LMS  adaptive  algorithm.  The  adaptive  filter  output  r(k)  is  defined  by 

L-1 

r(k)=  2 Wj(k)x(k-j-A)  (1) 

j = 0 

where  A is  the  prediction  distance  of  the  filter.  The  instantaneous  estimation  error  is  desig- 

1 f\  ^5^  ^ 

nated  as  e(k),  where  e(k)  = x(k)  - r(k).  It  has  been  shown  that  the  set  of  weight 
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coefficients  which  minimizes  the  expected  (mean)  square  error,  Ele-fk)) , of  the  LMS 
adaptive  filter  can  often  be  obtained  from  the  discrete  Wiener-Hopf  matrix  equation 

Rw*  = P..  (2) 

in  Equation  2,  R is  the  L X L autocorrelation  matrix  of  the  input  sequence  x(k),^is  the 
L X 1 cross-correlation  vector  with  elements  Pj  = E[x(k)x(k  + j + A - 1)1  and  the  w*  is  the 
optimal  least  mean  squares  weight  vector.  The  LMS  algorithm  is  a gradient  search  method 
which  iteratively  seeks  a solution  to  Equation  2 by  updating  each  filter  weight  by  the  recur- 
sive relation 

Wj(k  + 1)  = Wj(k)  + 2/ie(k)x(k-j-A)  (3) 

where  ju  is  a feedback  gain  controlling  stability  and  convergence  of  the  algorithm.  For 
uncorrelated  stationary  inputs,  it  has  been  shown that  the  expected  value  of  the  Wj(k) 
in  Equation  3 converges  in  the  mean  to  the  optimal  wj"  provided  that  n is  bounded  by 
0 < /i  < 1/^rnax  where  X^iax  largest  eigenvalue  of  the  autocorrelation  matrix  R. 

Since  the  LMS  algorithm  approximates  the  true  LMS  error  gradient  with  a finite  data  length 
estimate  of  the  gradient,  the  weight  vector  coefficients  thus  are  obtained  from  noisy  estimates 
of  the  optimal  Wiener  filter  solution.  This  gradient  estimation  noise  is  often  called  mis- 
adjustment  noise  and  is  given  by  Vj(k),  where 

Vj(k)  = Wj(k)  - Wj*  . (4) 

Widrow  et  al*  ^ have  shown  that  this  excess  error  may  be  made  arbitrarily  small  by  decreas- 
ing the  feedback  parameter  m.  Decreasing /a  increases  the  convergence  time  of  the  LMS 
algorithm,  however,  and  a compromise  between  the  response  time  and  the  misadjustment 
noise  power  must  be  made  in  practical  applications.  Convergence  properties  of  the  algorithm 
for  correlated  stationary  inputs  in  general  are  very  difficult  to  derive  analytically  and  a 
summary  of  results  in  this  area  is  presented  in  Reference  28.  In  many  practical  applications 
involving  correlated  data,  however,  the  expectation  of  Wj(k)  converges  within  an  excellent 
approximation  to  wj^  provided  jU  is  sufficiently  small. Reference  28  has  shown  that  ex- 
perimentally obtained  steady-state  coefficients  Wj(k)  for  inputs  consisting  of  sinusoids  in 
uncorrelated  noise  are  in  good  agreement  with  the  theoretical  optimal  coefficients  Wj*. 

It  is  generally  assumed*^  that  the  misadjustment  noise  is  Gaussian  distributed  and 
uncorrelated  from  weight  to  weight.  Further,  it  was  shown  for  uncorrelated,  stationary  inputs 
that 


ElVj(k)l  =0 

(5a) 

E[Vj{k)Vj(k)|  = Mc“6(i  -j) 

(5b) 

where  6(  ) is  the  Kronecker  delta  operator.  Further  it  will  be  assumed  that  the  variance  of 
the  misadjustment  is  essentially  constant  after  the  start  of  adaptation  and  is  not  dependent 
upon  either  signal  presence  or  absence.  Analytical  solutions  for  wj"  for  a single  sinusoid  in 
uncorrelated  noise  have  been  derived  by  Triechler"^  and  Reeves^^  for  a sinusoid  of  known 
frequency  under  the  condition  that 

— — = integer  (0) 

lit 
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where  is  normalized  relative  to  the  sampling  frequency  of  2^  radians.  When  Equation  6 
is  valid  and  the  input  is  given  by 

x(k)  = A sin  (cj^k  + 0)  + n(k)  (7) 

where  n(k)  is  a zero-mean  white  Gaussian  noise  secuvnce  power  u",  the  R matrix  is  circulant 
and  it  can  readily  be  shown-^’-^-^  that 

wj*  =-:^cos  [cJqU -H  A)1  . 0 <j  < L - 1 . (8) 


In  Equation  8, 


, * ^ (L/2)SNR 
**  l-i-(L/:)SNR 


(9) 


where  SNR  is  the  input  signal-to-noise  ratio  defined  as  SNR  = A"/2u".  Note  that  the  ampli- 
tude of  Wj*  depends  nonlinearly  on  the  input  SNR.  This  effect  is  due  to  the  difficulty  in 
separating  the  narrowband  and  broadband  components  of  x(k)  near  at  low  SNR.  wj  is 

defined  in  Reference  28  for  a single  sinusoid  in  white  noise  for  the  more  general  case  in  ■ 

which  R is  not  a circulant  matrix.  It  is  shown  that  Equation  8 also  provides  an  excellent  i 

approximation  to  W;  for  tt/L  < oj„  < 7r((L  -1  )/L)  when  R is  not  circulant. 

Triechler”^  has  shown  that  the  speed  of  adaptation  of  the  LMS  algorithm  is  control- 
led by  the  dominant  eigenvalues  of  the  R matrix.  For  the  case  of  a single  sinusoid  in  un- 
correlated noise,  the  optimal  weight  vector  is  orthogonal  to  all  eigenvectors  of  R except  the 
conjugate  pair  of  eigenvectors  corresponding  to  frequency  cJq.  The  eigenvalues  associated 
wiUi  these  eigenvectors  dominate  the  convergence  time  and  are  both  given  by  X = u“  + 

(A“L/4).  As  shown  in  Reference  20,  when  the  filter  is  filled  with  data  and  all  weights  are 
zeroed  at  k=0,  the  mean  value,  Wj(k),  of  the  weight  updates  by  the  relation 

Wj(k)  = [ 1 - ( 1 - 2pX)'^|  Wj*  ;k  = 0,l,...  . (10) 

When  2pX  < 1 and  k > 1 , Equation  10  reduces  to 

Wj(k)  = [ 1 - Wj^  (11) 

In  this  case  the  mean  adaptation  time  constant  is  given  by  = 1 l2^lK.  From  Equations 
J 8,  10  and  1 1 , the  dynamic  solution  for  the  adaptive  filter  weights  is  thus  given  in  the  mean 

r by 

Wj(k)=  cos  lcjQ(j-tA)l,0^j<L- 1 (12) 

I where 

a(k)=  |l  -e"'^'’^"’]  a*  . (13) 
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DERIVATION  OF  DETECTION  STATISTICS 


DETECTION  STATISTICS  FOR  T TE  ALE  OUTPUT  DEVICE 

The  detection  performance  of  the  ALE  filter  output  implementation  will  be  analyzed 
under  the  condition  that  all  adaptive  filter  weights  are  initialized  to  zero  at  k = 0 and  that 
the  filter  processes  N data  points  prior  to  forming  a detection  statistic.  The  adaptive  filter 
and  the  K-point  DFT  detector  are  restricted  such  that  K and  L are  much  less  than  N.  The 
restriction  that  (K,L)  <€  N eliminates  initial  transient  behavior  as  the  adaptive  filter  fills  with 
data  and  allows  the  mean  weight  approximation  given  by  Equation  1 2.  Detection  is  based 
on  a K-point  DFT  of  the  final  K output  values  from  the  L-length  fi’cer  and  will  be  perform- 
ed using  the  classical  Neyman-Pearson  hypothesis  test.  Two  hypotheses,  Hq  and  H] , will  be 
defined  by  Hq  = x(k)  + n(k)  and  Hj  : x(k)  = s(k  | d)  -t  n(k).  Here  s(k  10)  is  the  sinusoidal 
signal  of  known  amplitude  and  frequency  but  unknown  phase  and  nfk)  is  a sample  of  the 
Gaussian  uncorrelated  noise  sequence  with  zero  mean  and  variance  v : 

s(k  I 0)  = A sin  (cj^k  + 0)  , (14) 

E[n(k)n(p)]  = u^6(k  - p)  (15) 

where  v~  is  known  a prion. 

In  this  section  a detection  statistic  q will  be  defined  for  the  ALE  output  device  and  the 
conditional  probability  density  functions,  PQ(q)  and  Pi(q)  will  be  derived.  The  Neyman- 
Pearson  criteria  allows  the  probability  of  false  alarm,  Pp^,  under  Hq  to  be  completely 
specified  in  terms  of  fixed  threshold  /3q  such  that  if  q > ^q,  signal  presence  is  declared, 
whereas  if  q <|3q,  the  decision  of  signal  absence  is  made. 

Since  s(k  | 0)  in  Equation  14  is  completely  deterministic  and  zero-mean,  the  vari- 
able x(k)  is  a zero-mean  Gaussian  random  variable  for  all  values  of  k.  The  filter  output 
conditioned  on  the  unknown  initial  phase  is  thus  defined  by  Equation  1 . Further,  since 
r(k  I 0)  is  a linear  combination  of  previous  input  samples  it  is  also  a zero-mean  Gaussian 
random  variable.  The  output  spectrum  obtained  from  a K-point  DFT  of  r(k  | 0)  evaluated 
at  cOq  is  given  by  R(cJq)  = u -h  jv,  where 

K-1 

u = 2 r(k  I 0 ) cos  cOpk, 
k=0 

K-1 

v=  2 r(k  I 0)  sin  oj^k  . (16) 

k=0 

The  detection  statistic  q is  then  defined  as*^’^^  q = u^  + v“.  The  joint  pdf  for  (u,v)  must, 
therefore,  be  specified  to  define  the  pdf  of  the  detection  statistic  q.  Medaugh^'^  has  shown 
that  u and  v are  essentially  statistically  independent  for  the  signal  and  noise  models  assumed 
above  for  k > Since  r(k  | 0)  is  a Gaussian  random  variable,  u and  v are  also  Gaussian 
and  the  pdf  p(u,v)  is  written  as 


p(u.v)=i:;ra^,oJ->  exp  j-^  + ( ^)'] 


where  ii  and  v are  the  component  means  and  a-  and  a“  are  the  component  variances.  The 
values  of  iT  V,  a-  , a-  were  derived  for  the  steady-state  adaptive  filter  response  for  equal 
DFT  length  and  adaptive  filter  length  (K=l  ) in  Reference  34  and  extended  for  the  case 
K < L in  Reference  35.  We  will  consider  the  ALl-  detection  performance  for  K < L. 

F'or  TT'  L < cJq  < 7rl(  L -1  )/L|  , the  expressions  for  u and  v are  shown  in  Reference  35 
to  be  closely  approximated  by 

u = a* AK- sin  0,'2L.  (18a) 

V = a*AK“  cos  0/2L  . (18b) 

Likewise,  for  low  input  SNR^^ 


Ou  = Oy  = 


/iC“v“LK 


/ac“A"LK." 


2a*-u-K- 


\4  I -)  / 


where  c"  is  the  total  input  power,  signal  plus  noise.  Note  that  in  addition  to  raising  the 
mean  level  of  the  bin  component  as  indicated  bv  Lquation  18.  the  presence  of  the  il 
markedly  increases  the  variance  of  each  DFT  component  through  the  appearance  of  A and 
a*  in  Equation  14.  (For  the  weight  transform  implementation,  the  variances  of  u and  v are 
identical  under  Hq  and  H| . Reference  33).  The  raising  of  the  mean  bin  level  through  Equa- 
tion 18  serves  to  shift  pj  (q)  higher  on  the  q axis,  while  the  larger  variance  will  serve  to 
broaden  the  pdf.  These  effects  signify  that  P]  (q)  is  more  highly  skewed  toward  larger  q 
values  than  is  Po(q).  thus  enhancing  detection  over  the  case  if  P|(q)  were  simply  a 
mean-shifted  replica  ot  PQ(q). 

By  ensuring  that  N > and  that  (L.K)  < N,  a*  in  Equations  18  and  19  can  he  re- 
placed by  a(k)  to  describe  the  output  statistics  just  prior  to  convergence.  This  signifies 
the  means  of  Equation  1 8 build  up  exponentially  with  time  constant  T|^.|  to  their  converged 
values,  as  shown  by  Triechler.-*^  Thus,  Equation  18  may  be  rewritten  as 


where 


u = A sin  0.  v = A cos  6 


~ atklAK" 


Further,  in  this  case  the  variance  under  H|  is  now  approximated  h>  ay  = 0~  = o~.  where 
from  Fkiuation  19 


/jc“u"LK 


^ Mc~A~LK-  ^ 2a~)k)u~K- 
8 


(--  -)  • 
\ 4 1 / 


File  first  two  components  ofay  are  independent  of  time  index  k due  to  the  assumption 
that  the  weight  misadjustment  noise  is  independent  of  k as  defined  by  Equation  5,  The 
first  is  due  to  the  broadband  weight  misadjustment  noise  produced  by  the  input  white 
noise.  The  second  component  ofay  is  the  weight  misadjustment  caused  by  presence 


of  the  signal  and  is  due  to  a narrowband  noise  process  centered  at  frequency  co^.  This 
narrowband  noise  process  may  be  interpreted  as  adding  a randomly-phased  sinusoid  with 
frequency  co^  to  the  mean  weight  vector  and  the  magnitude  of  this  component  defines  the 
variance  of  the  envelope  of  the  sinusoid.  The  third  component  of  the  variance  is  due  to  the 
input  white  noise  in  the  bandwidth  of  the  adaptive  filter  and  increases  in  magnitude  with 
increasing  k as  the  mean  weight  vector  increases  as  defined  by  Equation  12.  Note  that  the 
magnitude  of  this  term  is  scaled  by  the  convergeijce  rate  of  the  mean  weight  vector. 

With  IT  and  T given  by  Equation  20  and  ay  given  by  Equation  2 1 the  derivation  ot 
the  pdfs  for  the  detection  variable  q is  straightforward and  is  outlined  in  Appendix  A. 
The  results  show  that  under  Hq,  pQ(q)  is  the  gamma  density  function  with  two  degrees 
of  freedom,  given  by 


Po<0)= — 7 «XP  -q/2ag  ,q>0 


In  Equation  22,  a“  is  the  variance  of  u and  v under  Hq,  which  is  obtained  from  Equation  21 
by  setting  A = 0 and  a*  = 0;  i.e.,  a“  = /ac“u“LK/2.  Under  Hj , q has  the  density  function 
P|(q)  given  by 

Pl(q)=  -l^exp  - -i_(q+A^)  Iq  ( ^ , q > 0,  (23) 

2ay  [ J \ ^1  / 

where  1q(x)  is  the  modified  Bessel  function  of  the  first  kind. 

Following  the  Neyman-Pearson  criteria,  the  Pp\  is  specified  and  used  to  obtain  the 
threshold  |3q  under  Hq; 

OO 

/ pQ(q)dq  = Pp;^  (-4) 

where  Pglq)  is  given  by  Equation  22.  Since  Pp^  and  Pq^Q)  “irc  known.  Equation  24  com- 
pletely specifies  the  threshold  (3^.  The  probability  of  detection,  Pq,  is  then  simply  the 
probability  that  the  variable  q exceeds  this  same  threshold  when  the  signal  is  present, 
given  by: 

OO 

Pd=  / P|(q)dq  <-5) 

^5o 

with  P|(q)  given  by  Equation  23. 

A change  of  variables  will  transform  the  integrals  in  Equations  24  and  25  into  more 
conventional  forms.  It  may  be  shown^^  that  equivalent  forms  of  these  integrals  are 


OO  ^ 

p,..=  / X3(p)dp 

vA  f 
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and 


Pd=  f x5(Pla)'Jp  <27) 

Tl 

■)  T' 

where  the  notation  Xt(P)  is  the  ehi-square  pdf  with  2 d.o.f.  and  x^  (P  I is  the  non-central 
chi-square  pdf  with  2 d.o.f.  and  non-centrality  parameters,  where  from  Fquations  20  and  21 


K^a-(k)SNR/iuc^L^ 


1 -1-  — SNR  -i- 

(-L- 

K 

9,  'J 

12 

(28a) 


Specification  of  in  Equation  26  completely  determines  the  threshold  T^.  This  threshold 
is  then  scaled  by  the  ratio  of  the  variances  under  Hq  and  Hj  to  define  the  threshold  T j by 


T 


1 " 


1 + — SNR  + 

7 


4Ka"(k)  II 
Mc2l3  \4 


(28b) 


Tlie  probability  integrals  Equations  26  and  27  may  be  evaluated  by  a non-numerical  method 

•5  *7 

developed  by  Urkowitz.-^ 


DETECTION  STATISTICS  OF  THE  ALE  WEIGHT  VECTOR 
IMPLEMENTATION 


Reeves^^  has  examined  the  case  of  detection  based  on  the  ALE  weight  vector  imple- 
mentation for  the  SKEP  problem.  Detection  is  based  upon  thresholding  a random  variable 
defined  to  be  the  magnitude-square  of  the  coQ-component  of  the  L-point  DET  of  the  final 
weight  vector.  Reeves  assumes  that  the  mean  ALE  weights  are  described  by  Equation  1 2 
and  that  Equation  7 is  also  a valid  description  of  misadjustment  noise.  These  lead  to  a deri- 
vation^^ of  integrals  which  have  the  forms 


and 


(2d) 


P 


D“ 


OO 

/ X5  (z  I 


(30) 


■>  'I 

where  = a‘'(k)/a^  , and  is  one-half  the  real  weight  variance  of  the  to -component, 
derivable  from  Reference  1 6 as  = mc“L/2.  Note  that  the  thresholds  in  each  integral  are 
Tq,  due  to  equal  variance  of  the  weight  vector  detection  statistic  under  and  H | The 


1 1 


integrals  Equations  29  and  30  may  also  be  non-numerically  evaluated  by  the  methods  out- 
lined in  Appendix  B. 

DETECTION  STATISTICS  FOR  INCOHERENTLY  AVERAGED 
DFT  PROCESSOR 

For  this  commonly  used  processor,  the  N-length  input  sequence  is  divided  into  M 
blocks  of  K-points  each  and  a coherent  K-point  DFT  is  fonned  for  each  block.  I he  block 
DFTs  are  then  averaged,  forming  the  detection  statistic  u (normalized  to  unity  variance), 
which  is  the  magnitude  square  of  the  component.  The  statistic  u under  has  been 
shown  to  be^^’^^  the  chi-squared  density  function  with  2M  degrees  of  freedom  due  to  the 
2M  random  variables  added  together  to  form  u.  This  defines  the  threshold  by  the 
integral  relationship 

OO 

Po 

Under  Hj,  the  detection  statistic  u has  the  non-central  chi-squared  density  function  with  2M 
degrees  of  freedom  and  non-centrality  parameter  Oj,  where  Oj  = N (SNR)  from  Reference  33. 
Since  the  statistical  variance  on  u is  not  dependent  upon  signal  presence  or  absence,  the 
threshold  under  H-  is  equal  to  |3q  giving  the  following  expression  for  Pd^ 

OO 

P0=  j X2M I 

/5o 

The  threshold  in  Equation  31  and  the  evaluation  of  Pq  in  Equation  32  may  be  found 
by  the  methods  presented  in  Appendix  B. 

PERFORMANCE  ANALYSIS 

The  detection  performance  of  the  adaptive  implementations  and  the  conventional 
DFT  processor  of  the  previous  section  will  be  compared  on  the  basis  of  ROC  analysis.  A 
value  of  Pp^  is  specified  and  held  constant  for  a set  of  hypotheses  tests,  while  the  input 
signal  strength  is  varied.  The  results  are  sets  of  curves  which  give  Pd  as  a function  of  input 
SNR  for  different  processor  configurations. 

There  is  a fundamental  tradeoff  in  selecting  appropriate  adaptive  filter  time  con- 
stants since  reducing /a  decreases  the  misadjustment  noise  power  as  indicated  by  Equation  5, 
but  correspondingly  increases  convergence  time  as  indicated  by  Equation  1 1.  Reeves  has 
shown  that  there  is  an  optimal  value  of /a  which  maximizes  the  probability  of  detection  at 
specified  values  of  N,  L and  SNR  for  the  ALE  weight  implementation.  Assuming  a low 
input  SNR  (c“  2:  u“)  and  that  detection  is  based  on  the  last  weight  vector  (k  = N).  the 
optimal  value  of  ju  was  shown  to  be: 


12 


(33) 


i 


1.25643 

L A 

2c-N(l  + -^SNR) 

A 

where  SNR  is  a nominal  input  SNR  which  may  be  expected  at  the  receiver  input  and  the 
numerator  of  liquation  33  is  the  non-trival  root  of'  the  transcendental  equation 
( 1 + 2x)e“^  = 1 . Reeves  has  shown  that  detector  performance  is  not  strongly  inlluenced  by 
choice  of  SNR.  Medaugh^^  has  similarly  investigated  the  optimality  of /a  for  detection 
based  on  filter  output  for  the  case  K = L and  found  the  same  expression  as  liquation  33 
gives  the  optimal  /a.  These  results  may  be  easily  extended  to  the  case  for  K < L by  simul- 
taneously^ maximizing  the  non-centrality  parameter  a and  minimizing  the  threshold  scaling 
factor  (o“/aT),  both  given  in  liquation  28,  with  respect  to  n.  By  taking  partial  derivatives 
of  cr  and  (Oo/a| )“  with  respect  to  /a  and  equating  to  zero,  it  may  be  shown  that  as  de- 
fined by  liquation  33  is  equivalently  the  optimal  /a  for  detection  based  on  filter  output  witli 
L. 

An  important  property  of  the  ALfi  is  the  relative  insensitivity  of  the  adaptive  filter  to 
its  feedback  gain  parameter  /a.  While  there  is  an  optimal  /a  value  for  a given  fixed  set  of 
parameters,  detection  performance  of  either  ALli  implementation  is  not  critically  dependent 
upon  /a,  as  seen  from  Figure  2.  In  this  figure,  the  SNR  at  which  P|)  equals  50  percent  has 
been  plotted  as  a function  of  ^ for  both  processors  for  some  typical  parameters.  The  Pp^ 
was  held  constant  at  10“^.  The  shape  of  the  curves  for  each  implementation  is  consistent 
for  both  filter  lengths  and  the  broad  minima  show  that  near-optimal  (with  respect  to  /a) 
performance  may  be  obtained  for  a wide  range  of /a  settings. 


' iguic  2.  SNR  al  .SO  [x'lccnt  as  a liinclion  ol  opllnial 
*^0  K = L.PpA=IO-4. 
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Figure  3 displays  Pq  versus  input  SNR  for  the  case  K = L = 1024  and  an  input  data 


r 


-40  dB  -30  dB 

input  SNR 


Figure  3.  ROC  curves  for  Pp^  = 10"^,  N = 102,400. 

K = L=  1024,/ic2  = 4.1  X 10'^. 

sequence  of  N = lOOL.  For  comparison  the  ROC  curve  for  the  optimal  processor  for  this 
case  is  also  shown.  The  optimal  processor  is  a 102400-point  DFT  which  processes  the  entire 
data  record  coherently.  The  ROC  curves  for  the  ALE  implementations  with  m chosen  by 
Equation  33  are  compared  with  that  of  the  incoherently  averaged  DFT  (lAD)  processor 
which  ensemble  averages  a total  of  M = 100  of  the  K-point  DFTs  of  the  input  sequence. 
Performance  of  the  latter  three  processors  is  seen  to  be  nearly  equivalent  in  the  vicinity 
of  Pq  = 50  percent,  with  the  ALE  output  device  giving  slightly  poorer  performance  at 
higher  SNRs,  but  slightly  better  at  lower  SNRs.  This  bending  of  the  output  curve  is  caused 
by  the  components  of  the  filter  output  variance  which  are  due  to  signal  presence,  as  shown 
by  Equation  21 . This  behavior  does  not  occur  in  the  ALE  weight  device  or  the  lAD  proc- 
essor since  the  variances  under  Hq  and  Hj  are  equivalent.  Figure  4 shows  the  case  for  short- 
ening the  filter  and  DFT  lengths  to  K = L = 128  but  retaining  the  relation  N = lOOL.  The 
general  shapes  of  the  curves  are  consistent  with  those  in  Figure  3;  the  primary  difference  is 
in  raising  the  detectable  SNR  levels,  reflecting  the  lower  number  of  data  samples  used. 
Figures  3 and  4 together  demonstrate  the  ability  of  the  ALE  implementations  to  maintain 
consistent  detection  characteristics  over  a wide  range  of  filter  lengths  and  SNRs,  providing 
the  ratio  N/L  remains  constant.  The  effect  of  altering  this  ratio  is  seen  in  Figure  5.  which 
gives  performance  for  K = L = 1024  and  N = 20L. 

Another  important  result  of  adaptive  prefiltering  may  be  seen  in  Figure  6.  in  which 
filter  length  L has  been  held  constant  at  L = 1024  and  K has  been  varied  over  a wide  range 
such  that  K < L.  As  K decreases,  performance  degrades  only  slightly  and  asymptotically 


-40  dB 


-30  dB 
input  SNR 


-20  dB 


Figure  6.  ALE  output  ROC  curves  for  fixed  filter  length, 
L,  and  varying  DFT  length,  K.  Pp^  = 10“^.  L = 1024, 
AIC^  = 4.1  X lO"^. 


approaches  an  ROC  curve  which  is  independent  of  K.  This  is  due  to  the  detection  statistic- 
distribution  under  H|  being  more  strongly  a function  of  L than  K.  This  can  be  seen  by 
making  the  substitution  K = mL  in  Equations  28a  and  28b,  where  m is  the  fraction  (m  < 1 ) 
relating  DFT  and  ALE  length.  Equations  28  may  then  be  rewritten  as: 


m^a-fk)SNR 

/ic“(l  + ^ SNR)+  (3  -m) 


(34a) 


Ti=  ^ T„  . (34b) 

1 + — SNR+  (3-m) 

- 3/JC-L 

Now  let  m approach  zero  in  Equation  34,  signifying  shorter  and  shorter  DFTs  with 
respect  to  ALE  length.  Performing  a limiting  operation  produces  a unit-length  (one-point) 
DFT,  giving  m = L~' . For  practical  filter  lengths  and  SNR  values.  Equations  34  then  are 
closely  approximated  by: 


a-(k)SNR  ^ Q 
juc^  L^ 


I 


From  Fquation  35a  it  is  seen  that  the  non-cciitrality  parameter,  a.  approaches  zero  as  K is 
taken  to  its  lower  limit.  However,  from  Equation  35b  it  is  seen  that  there  is  still  an  appre- 
ciable threshold  reduction  under  H|,  which  is  entirely  a function  of  the  adaptive  filter  para- 
meters. The  traction  a“(k)//ic“L"  has  an  appreciable  magnitude  for  a wide  range  of  filter 
parameters,  thus  providing  the  reduced  threshold  T|  and  the  resultant  gain  in  detection. 
Using  an  adaptive  filter  length  of  L = 1024,  detection  ROC  curves  for  K = 1024,  800,  512 
and  32  are  shown  in  Figure  6.  The  limiting  cases  of  Equations  35  are  approached  quickly, 
as  can  be  seen  from  the  closeness  of  the  curves  for  K = 51  2 and  K = 32. 


SU.MMARY 

This  paper  has  e.xamined  the  detection  properties  of  two  adaptive  detector  implemen- 
tations for  the  SKEP  problem.  Detection  performances  by  ROC  analysis  for  the  L-tap  ALE 
weight  vector  and  filter  output  devices  using  K-point  DFTs  and  processing  N-length  data 
sequences  have  been  shown  to  be  roughly  equivalent  to  incoherent  averaging  of  an  L-point 
DFT  lor  a wide  range  of  K.  L and  N.  The  specific  differences  between  detection  properties 
of  each  ALE  implementation  have  been  described  and  shown  to  be  due  to  the  different 
variance  characteristics  of  each  processor.  The  detection  performance  of  the  ALE  processors 
has  been  shown  to  be  relatively  insensitive  to  the  choice  of  adaptive  feedback  constant  /a. 
Finally,  detection  performance  for  the  two  ALE  implementations  was  also  shown  to  be  not 
crucially  dependent  upon  DFT  detector  length  K. 
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APPENDIX  A 

DERIVATION  OF  PDFs  FOR  ALE  FILTER  OUTPUT 


I'rom  Equation  1 7,  the  joint  pdf  for  the  random  variables  u and  v is  specified  in 
terms  of  their  means  and  variances  under  H| . The  output  statistic  q lias  been  defined  as 
the  sum  of  the  squares  of  u and  v: 

q = u-  + v-  <Ah 

and  thus  it  is  necessary  to  make  the  transformation  of  variables  to  obtain  the  pdfs  of  q ‘rom 
the  joint  pdf  of  (u,v).  This  transformation  is  straightforward  and  follows  the  development 
in  Whalen.* 

First,  the  pdf  of  random  variable  q under  H| , denoted  P|(q).  will  be  derived  and  a 
simple  modification  to  P|(q)  will  allow  us  to  find  easily  the  pdf  of  q under  denoted 
p^tq).  Thus,  from  Equation  17 


I'rom  Equation  19,  it  is  seen  that 


T 

or.  = or  = oy 


and  from  Equation  20  the  component  means  under  H|  are  given  by 
u = A sin  0 


■ = A cos  6 . 


Using  these  relations,  the  joint  pdf  for  (u,v)  given  0,  denoted  by  P|(u,v  | 0).  becomes 


P](u,v|0)=  — exp  |-  — L 

27107  I 2or 


u“  + V-  + A"  - 2A  (u  sin  0 + V sin  0) 


Now  make  the  intermediate  change  of  variables 

V = z cos  .//  , u = 7.  sin  ^ 

This  will  allow  Equation  A5  to  be  transformed  from  the  u,v  coordinates  to  the  coor- 
dinates by  the  transformation* 


P|(Z,l//l0)  = I J I Pi(u,v  I 0) 


u = f(z,</) 
V = gtz,.//) 


where  | J I is  the  determinant  of  the  Jacobian  for  z and  ^ and  is  given  by 


0U 

9z 

3z 

0V 

3 LI 

d4> 

bxlj 

For  these  variables,  I J I = z and  thus,  by  using  a trigonometric  identity.  Equation  A5 
becomes 

P|(z,vl/  I d)  = — ^ exp  I - — ^ 7?  + A“  - 2Az  cos  (6  - \I')  | 

Zrroi  ( 2oy  -I) 

The  dependence  on  the  dummy  variable  \}/  may  be  removed  by  integrating  over  the 
range  of  \p : 


P[(z  1 0)  = j P|(z,\^  I d)d\l/ 


z > , 2 . T'’,  f 

= — ; exp r {z“  + A”) 

2^1  I 2af  J ^ 


exp  — —cos{6-\}/)  dip  . (A7) 


The  integral  in  Equation  A7  is  well  known  to  be 


r~  Az 

I exp  — c 


cos  (0  - 1^)  dvi/  = 27rl  ( — 


-(I  / Az  \ 

“’^'o  — 

\n  / 


where  Iq(x)  is  the  modified  Bessel  function  of  order  zero.  Thus  the  pdf  of  the  statistic  z 
is  no  longer  dependent  upon  0 and  is  given  by 


Pl(z)=  exp  - 


' / ■’  J.  li  / Az\ 

— (Z-  + A-)  lo  — • 

2aT  \ oy  / 


One  more  change  of  variable  must  be  made.  From  Equation  A6  it  is  seen  that 
z“  = u*-  + V-,  which  is  exactly  equal  to  the  detection  statistic  q as  defined  by  Equation  A1 . 
Thus  the  pdf  of  the  desired  statistic  q may  be  obtained  by  the  transformation 

Pl(q)=  ^ P(z) 

z = f(q)  . 


In  this  transformation,  z = q^^and  Hquation  A8  becomes 


1 /Aa'/-\ 

--i^(q  + A-)  lo  ^ 

2ai-  \ / 

. q>0  . 

(Aq) 

1 

1 


Equation  A^)  is  the  pelf  of  the  desired  detection  statistic  q under  H | . The  pdf  of  q 
under  the  liypothesis  of  no  signal  present,  is  easily  obtained  from  Equation  by 
observing  that  for  no  signal  present,  A = 0 from  Equation  20b  and  the  variance  becomes 
a-  as  defined  in  Equation  22.  The  pdf  of  q under  H^,  PQ(q)i  results  from  substituting 
these  values  into  Equation  A^),  giving 


Po(q)  = 


1 


exp 


q X) 


(AlO) 


since  1^(0)  = 1. 
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APPENDIX  B 

EVALUATION  OF  PROBABILITY  INTEGRALS 


To  determine  the  detection  performance  of  the  processors  mentioned  in  this  paper, 
it  is  necessary  to  evaluate  two  integrals  of  the  forms: 


f Xp  (r)dr 


Pd=  f Xpirla) 


In  Equation  B1 , is  the  chi-square  probability  density  function  with  F degrees  of  free- 
dom given  by  Reference  36  in  the  main  text; 

(F-2)/2 

exp  [-r/2)  , r>0  (B3) 

^ 2F/-nF/2) 

where  r(F/2)  is  the  gamm^  function  which  for  integer  values  of  (F/2)  may  be  replaced  by 
tF/2)!  In  Equation  B2,  Xp(r  la)  is  the  non-central  chi-square  pdf  with  F degrees  of  freedom 
and  non-centrality  parameter  a,  given  by 


/ \ <F-2)/4  r j 


>0  . 


Consider  first  Equation  Bl,  whici’.  is  used  to  determine  the  threshold,  T^^,  under 
Hq-  ForF  = 2,  it  is  seen  that  the  integral  in  Equation  Bl  may  be  evaluated  immediately 
to  give  the  relation 

To  = -2lnPpA  • (B5) 

For  F > 2,  however,  direct  integration  of  Equation  Bl  becomes  quite  tedious.  Fortunately. 
Urkowitz**  has  developed  a nomographic  method  for  determining  Tq.  given  and  F, 
and  its  use  is  straightforward.  This  method  is  accurate  for  F < 200,  but  for  F > 200.  the 
pdf  Xp  (r)  is  essentially  Gaussian  and  the  following  approximation  is  valid:  ** 

T — F 

Pfa  - 4 <*^6) 

where  erfc(z)  is  the  complementary  error  function,  given  by 


'>  r -I 

ertc(z)  = — ~ / cxp(-x“) 

■v/TT  J 


*A.D.  Wlialen,  Detection  Signals  in  Noise.  Academic  Press,  New  York,  . 

**llarry  Urkowitz,  “Energy  Detection  of  Unknown  Deterministic  Signals."  Proceedings  ol  II  1 I..  V olume 
Number  4,  April  1967. 
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This  function  has  been  extensively  evaluated  and  appears  in  many  references*. 

Now  consider  the  integral  in  Equation  B2.  The  evaluation  of  Pp  requires  that  the 
non-central  chi-square  pdf  be  integrated  from  a known  threshold  r = T j to  r = «>.  This  is 
equivalent  to  finding  the  quantity 


Pd=!- 


I *>’ 

/ Xplr  ta)dr 


(B8) 


where  the  second  term  of  the  right-hand  side  of  Equation  B8  is  recognized  to  be  the  non- 
central chi-square  cumulative  distribution  function  evaluated  at  r = T| . Urkowitz  has  shown 
that  this  function  may  be  made  equivalent  (through  appropriate  choice  of  arguments)  to 
the  incomplete  Toronto  function**  for  which  tabulated  curves  are  available.  The  incom- 
plete Toronto  function  is  defined  by 


B 


Tg(m,n,p)  = 2p 


n-m+1 


I jm-n  p-t‘ 


ln(2pt)dt  . 


(B9) 


Using  Equations  B4,  B8  and  B9  and  the  relations 


Tl 

II 

CD 

ni 

= F-l 

n 

= (F-2)/2 

P“ 

II 

(BIO) 


One  may  show  that  the  probability  of  detection  Pd  is  given  by 


Pd=i 


T^TYTIIF-  1 ,F/2  - 1 , y/^) 


(BID 


Curves  for  the  incomplete  Toronto  function  for  a number  of  values  of  F are  given  in 
Marcum**,  but  it  is  often  desired  to  calculate  Pd  for  degrees  of  freedom  other  than  those 
selected  in  Marcum.  An  example  of  this  is  the  case  of  incoherent  averaging,  in  which  F;-, 
uncorrelated  statistics  with  two  degrees  of  freedom  are  summed  to  produce  the  final 
detection  statistic  with  F degrees  of  freedom.  In  this  case,  F may  be  too  large  to  be  useful 
in  the  above  method.  However,  Urkowitz  has  developed  a nomographic  method  to  evaluate 
Pq  in  such  cases.  This  method  is  valid  for  F < 200  and  for  F > 200,  the  (laussian  approxi- 
mation may  be  employed  which  produces  the  following  approximation  for  Pd 


D 


— erfc 


1 


F - a 


2VF+  2a 


(B12) 
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